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Abstract — Kolmogorov-Smirnov (K-S) test-a non-parametric 
method to measure the goodness of fit, is applied for au- 
tomatic modulation classification (AMC) in this paper. The 
basic procedure involves computing the empirical cumulative 
distribution function (ECDF) of some decision statistic derived 
from the received signal, and comparing it with the CDFs 
of the signal under each candidate modulation format. The 
K-S-based modulation classifier is first developed for AWGN 
channel, then it is applied to OFDM-SDMA systems to cancel 
multiuser interference. Regarding the complexity issue of K-S 
modulation classification, we propose a low-complexity method 
based on the robustness of the K-S classifier. Extensive simulation 
results demonstrate that compared with the traditional cumulant- 
based classifiers, the proposed K-S classifier offers superior 
classification performance and requires less number of signal 
samples (thus is fast). 

Index Terms — Automatic modulation classification, 
Kolmogorov-Smirnov test, OFDM, interference cancellation. 

I. Introduction 

Automatic modulation classification is a procedure per- 
formed at the receiver based on the received signal before 
demodulation when the modulation format is not known to the 
receiver. It plays a key role in various tactical communication 
applications. It also finds applications in emerging wireless 
communication systems that employ interference cancellation 
techniques - in order to demodulate and cancel the unknown 
interfering user's signal, its modulation format needs to be 
classified first. 

The feature-based modulation classification methods are 
popular, and they base on feature extraction and decision JT)- 
0. The most widely used feature is the cumulant. It can be 
used to classify many different modulation types by high- 
order statistic cumulants [4|. It is simple to implement and 
can achieve nearly optimal performance with large number 
of samples J6). For example, the fourth-order cumulant can 
be used to classify various low-order modulations. For clas- 
sifying higher-order constellations, a higher-order cumulant is 
needed. An accurate estimate of the higher-order cumulant of 
the signal requires a large number of signal samples. Most 
of the existing works on modulation classification focus on 
the additive white Gaussian noise (AWGN) channel. A few 
works have considered fading and multipath channels (6], 
0. However, effective modulation classifier with less signal 
samples remains a challenge. 

In this paper, we propose to employ the Kolmogorov- 
Smirnov (K-S) test [8] for modulation classification. The K- 



S test is a non-parametric statistical method to measure the 
goodness of fit. From the received signal, we compute the 
empirical cumulative distribution function (CDF) of certain 
decision statistic. A priori we also compute the CDF of 
the same decision statistic under each candidate modulation 
format. The modulation format that results in the minimum 
of the maximum distance between its CDF and the observed 
empirical CDF is the final decision. We develop K-S classifiers 
based on quadrature amplitude decision statistics, then apply it 
to OFDM-SDMA systems to cancel the multiuser interference. 
Regarding high complexity involved by CDF calculation, we 
propose a low-complexity method based on the robustness 
of K-S classified We provide extensive simulation results 
to demonstrate the performance gain of the proposed K-S 
classifiers over the cumulant-based classifiers. The remainder 
of this paper is organized as follows. In Section II we provide 
some background on modulation classification and on the K- 
S test. In Section III, we develop the K-S-based modulation 
classifiers based on K-S test, and its corresponding low- 
complexity method. Section IV is devoted to an application of 
K-S classifier in OFDM-SDMA system for interference signal 
recognition and cancellation. Simulation results are provided 
in Section V. Section VI concludes the paper. 

II. Background 

A. Automatic Modulation Classification 

Consider the following discrete-time additive white noise 
channel model 



Vn 



n = !,■■• ,N, 



(1) 



where x n , y n and w n are respectively the complex- valued 
transmitted modulation symbol, the received signal, and 
the noise sample at time n. The transmitted symbols 
{xi, ■ ■ ■ ,xn} are drawn from an unknown constellation set 
M. which in turn belongs to a set of possible modulation 
formats {.Mi, • • • ,A4k}- The modulation classification prob- 
lem refers to the determination of the constellation set A4 to 
which the transmitted symbols belong based on the received 
signals {yi,--- , J/jv}- There are two major approaches in 
the literature to solving the above modulation classification 

'it is worth to point out that during the preparation of this paper's 
presentation we discover the impressive work in [91 that was submitted 
recently and investigates low complexity issue of modulation classification. 



problem. In the likelihood-based methods iflOl , ifTTl . some 
form of the likelihood for each modulation format is calculated 
by making certain assumption on the data sequence. The 
classification decision then corresponds to the modulation 
with the largest likelihood value. These methods are typically 
computationally very expensive. Moreover, they require the 
knowledge of the various channel parameters and become 
ineffective in the presence of unknown channel impairment 
such as fading, phase and frequency offsets, and non-Gaussian 
interference/noise . 

The more popular and low-complexity approach to au- 
tomatic modulation classification is based on cumulant |4j, 
0. Specifically, for the system given by (HJ, we calculate 
the normalized sample fourth-order cumulant of the received 
signal {y n } as 

E{M 4 }-|E(y 2 )| 2 -2E 2 { M 2 } 



C 



(2) 



{E{|yP} - CT 2 } 2 

The modulation whose theoretical cumulant [10| is closest 
to C is then the classification decision. The fourth-order 
cumulants can be used to classify 4-QAM, 16-QAM and 64- 
QAM modulations. For even higher-order modulations, the 
difference between the cumulants becomes small, which leads 
to low classification accuracy. A higher-order cumulant should 
be used to classify these modulations, with a considerably 
increased computational complexity. 

B. Kolmogorov-Smirnov (K-S) Test 

The Kolmogorov-Smirnov (K-S) test is a non-parametric 
test of goodness of fit for the continuous cumulative distri- 
bution of the data samples JS), fl2l . Ifl3l . It can be used 
to approve the null hypothesis that two data populations are 
drawn from the same distribution to a certain required level 
of significance. On the other hand, failing to approve the null 
hypothesis shows that they are from different distributions. 

In this paper we consider the one sample K-S test. In the 
test, we are given a sequence of i.i.d. real-valued data samples 
Z\,Z2,--- ,zn with the underlying cumulative distribution 
function (CDF) F\(z), and a hypothesized distribution with 
the CDF Fq(z). The null hypothesis to be tested is 



Fi — Fq 



(3) 



The K-S test first forms the empirical CDF from the data 
samples 



1 N 



(4) 



where I(-) is the indicator function, which equals to one if the 
input is true, equals to zero otherwise. The largest absolute 
difference between the two CDF's is used as the goodness-of- 
fit statistic, given by 



D±sup\Fi(z)-F (z)\, 

and in practical, it is calculated by 

D = max |Fi(z„) - F (z n 

Kn<N 



(5) 
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The significance level a of the observed value D is given by 
a = P{D > D) = Q ([VN + 0.12 + 77=] -CM , (7) 



with Q(x) = 2^(-l) m -V 



(8) 



m— 1 



The hypothesis Hq is rejected at a significance level a if a = 

P{D > D) <a. 

III. K-S-based Modulation Classification 

Consider the signal model in (fl}. In this section, we assume 
that the i.i.d. noise samples follow the complex Gaussian 
distribution, i.e., w n ~ Af c (0,a 2 ); that is, the real and imag- 
inary components of w n are independent and have the same 
Gaussian distribution Af c (0, %-). To classify the modulation 
based on the received signals {y n } using the K-S test, we 
first form a sequence of decision statistics {z n } from {y n }, 
where z n can be either the magnitude, or the phase, or the 
real and imaginary components of y n , and then compute the 
corresponding empirical CDF F%. In the meantime, for each 
possible modulation candidate Aik, we can obtain either the 
CDF Fq for {z n }. The K-S statistic is then calculated by 

D= max |A(^)-^(-2n)l> k = l,2,---,K. (9) 

l<n< N 

The decision on the modulation is given by the minimum K-S 
statistic, i.e., 

k — are min Dk- (10) 

l<k<K 

Moreover, recall that associated with each K-S statistic F>k, 
there is a significance level a.k = P(D > Dk\A4k), computed 
by (0. The normalized {dfe} can be used to give a "soft" de- 
cision on the modulation, that is, the probability that the mod- 
ulation M.k is used approximately qk — &k/ 'X)«=i^; ^ = 
1, • • • ,K. In what follows, we discuss the decision statistics 
{z n } for different modulation formats, and the corresponding 
CDF F . 

We consider the quadrature amplitude modulation (QAM) 
formats, e.g., 4-QAM, 16-QAM, and 64-QAM. The set of 
signal points of unit-energy constellations for these modula- 
tions are given by AI4-QAM = {-- (a + bj)\a,b = —1, 1}, 



M 16 - 



16-QAM 



b])\a,b = -3,-1,1,3}, -M 6 4-qam 
7, —5, —3,-1, 1, 3, 5, 7}, where j 



We suggest a quadrature-based K-S classifier, which is first 
proposed in our work of 0141 . Since for QAM input signals, 
the real and imaginary components of the received signal y n 
are independent and have identical distributions, we can also 
use them directly as the decision statistics. That is, from the 
N received signals samples j/i, ■ • • ,yN, we form a sequence 
of 2A^ samples of decision statistic 



z 2n -i = z 2n = 3{y n }, n = 1, 
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Fig. 1. QAM modulation (4-QAM, 16-QAM, 64-QAM) classification Fig. 2. QAM modulation {4-QAM, 16-QAM, 64-QAM} classification 
performance versus SNR offset. SNR= 15dB. performance in AWGN channels. The number of samples N = 100. 



Then we have z n '~' M{Q, 
modulation Ai^ is given by 



-). Hence the CDF under 
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where Qo(a) is Gaussian Q-function, and ^R{A4k} denotes 
the set of real components of the signal points in Aik- The 
K-S test in <f9b - ( 1 1 Ob can be performed using Fi and {F fc } on 
the samples z\, ■ ■ ■ , Z2N- 

Due to the complicated CDF expressions, it is computa- 
tionally expensive to calculate CDF at each samples. Since 
the K-S classifier has the property of robustness, which will 
be proved in Fig. 1, i.e. the correct classification performance 
is robust to SNR mismatch. With respect to the robustness, 
we can quantize the received SNR with different granularity. 
In a certain quantization, one CDF curve is stored for each 
SNR, where the curves are also comprising of discrete points 
which are denser than the SNR interval. All the curves can be 
computed offline. With the storage of these CDF curves, we 
can avoid the complicated computation by looking up tables. 
Thus, all involved computations are only ECDF calculation 
and comparison operation. 

IV. Application: Interference Cancellation in 
OFDM-SDMA Systems 

We next consider an application of modulation classification 
in the context of interference cancellation in an OFDM system 
employing multiple receive antennas. Specifically, assuming 
the OFDM receiver is equipped with two receive antennas, 
which makes it possible to have two users simultaneously 
transmitting data - the so-called space-division multiple-access 
(SDMA) [ 1 5 1 . That is, the received signal at the £-subcarrier 
of the n-th OFDM word is given by 



Yt(n) 



HgX t {n) 
,P; 



+ GgX'gin) 
n = 1 , • ■ ■ . 



W t (n), 



N, 



(12) 



where Y E C 2xl denotes the received signals at the two 
receive antennas; H £ C 2xl denotes the channels between the 
desired user's transmitter and the receive antennas; G £ C 2xl 
denotes the channels between the interfering user's transmitter 
and the receive antennas; W ~ A/" c (0, er 2 J) is the received 
noise sample vector. 

The receiver aims to demodulate the desired user's symbols 
{Xi(n)}. It is assumed that the receiver knows the modulation 
of the desired user, but not that of the interferer. It is also 
assumed that the receiver can estimate the channels of both the 
desired user and the interferer, {Hg, Gg}. A simple receiver 
scheme is to apply a linear MMSE filter to the received 
signal Y((n) to suppress the interferer, and demodulate Xg(n) 
based on the output of this filter. A more powerful receiver 
scheme is to employ interference cancellation. That is, we 
first demodulate the interferer's symbols {X'^n)} and then 
subtract the interferer's signals from the received signals. 
Finally we demodulate the desired user's symbols based on the 
post-cancellation signals. In order to estimate the interferer's 
symbols, we must first classify the modulation format used 
by the interferer. Hence the interference cancellation receiver 
consists of the follow steps. 



For each subcarrier, apply a linear MMSE filter M# 

( v - x _1 
ae lHiH e 

press the desired signal 



Gi to the received signal to sup- 



eGf (HiHf 



a 2 I) 



with at = [G?(H £ Hf + a 2 I)- 1 G t }- 1 . (13) 

By the choice of ag in (37), we can write 

lt (n) = X e (n)+w e (n), (14) 

where wg(n) = Mf (H gXg(n) + Wg(n)) contains the 
residual desired user's signal and noise. The distribution 
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Fig. 3. QAM modulation {4-QAM, 16-QAM, 64-QAM} classification 
performance versus sample size in AWGN channels. SNR= 14dB. 



of wg(n) can be accurately modeled as Gaussian with 
zero mean and variance a 2 = \Mf Hi\ 2 + a 2 \Mi\ 2 . 

• Based on the linear MMSE filter output ( TT4"1 >. which is an 
AWGN model, we can apply the K-S classifiers discussed 
in Section [HI] to classify the modulation format for the 
l-th subcarrier group, I = 1, • • • , £. 

• Once the modulation format of the interferer on each 
subcarrier group is estimated, we can demodulate the 
interferer's symbols {X' e (n)} based on the linear MMSE 
filter output (TBI . 

• Next we perform interference cancellation on each sub- 
carrier, followed by a matched-filtering for the desired 
users's signals, i.e., 

t (n) = Hf (Y t (n) - G e X' £ (nfj (15) 

Finally the desired user's symbol Xi>(n) is demodulated 
from f3g(n). 

V. Simulation Results 

In this section, we provide simulation results to compare the 
performance of the proposed K-S-based modulation classifier 
with that of the cumulant-based one. For the QAM modula- 
tions, we will consider the set {4-QAM, 16-QAM, 64-QAM} 
in AWGN channel. The channel model is given by (1) with 
w n ~ jV c (0, a 2 ). The signal-to-noise ratio (SNR) is defined 
as 1/cr 2 . The 4-th order cumulants are used. The number of 
received signal samples used is N = 100. In Fig. 1, consider 
the SNR mismatch at the receiver. If the noise power is half 
or twice as the original value, there is 3dB or -3dB SNR 
mismatch. We compare the robustness of K-S and cumulant 
classifiers. In the whole range, K-S classifier is always much 
better than the cumulant one, although there is degradation on 
large value of SNR offset. Cumulant method is robust in the 
range, however, its performance is not satisfactory with such 
few sample number. The classification performance of various 



classifiers in AWGN channels for QAM modulations is shown 
in Fig. 2, including quantized K-S classifier. We also show 
the performance of the Hellinger-distance-based classifier [ 16 1 
which has a very high complexity. It is seen that for such a 
small sample size, at high SNR, the cumulant-based methods 
exhibit a ceiling on the classification probability around 0.8. 
However, the K-S-based classifier monotonically improves 
the classification performance as the SNR increases and it 
significantly outperforms the cumulant-based classifier at high 
SNR. The Hellinger-distance-based classifier performs worse 
than the cumulant method in the low SNR region and in the 
high SNR region it performs worse than the K-S quadrature 
method. As is shown in Fig. 2, the performance suffers more 
as the SNR quantized interval increases, where the quantized 
interval of each CDF curve is 0.01 and the scale is between -4 
to 4, and larger values are ignored since they are with trivial 
probability. With respect to the performance degradation at - 
3dB and 3dB SNR mismatch in Fig. 1, K-S classifier is still 
superior to the other two classifiers even with 5dB quantized 
interval. The classification performance for QAM modulations 
as a function of the sample size is shown in Fig. 3. 



We next consider the effect of modulation classification 
on the performance of interference cancellation in an OFDM 
system employing multiple receive antennas. The signal model 
is given by (TTZb . Again the 3GPP channel model is used to 
generate the multipath channels for multiple antennas ftVT] , 
ff8l . There are 512 subcarriers in one OFDM symbol, which 
are shared by both the desired user and the interferer. For 
simplicity we assume that the same QAM modulation is 
employed on all subcarriers for each user. Hence modulation 
classification is based on samples from one OFDM symbol. 
We assume that the channels of both the desired user and the 
interferer are known. In Fig. 4 and Fig. 5 we compare the bit 
error rate (BER) performance of four receivers, namely, the 
linear MMSE receiver, the interference cancellation receivers 
using the K-S modulation classifier and the cumulant-based 
modulation classifier, respectively, and an "ideal" receiver that 
is assumed capable of completely removing the interferer's 
signal (and hence achieving single-user performance.) Note 
that the last receiver performance serves as a lower bound 
to the performance of any practical receiver. In Fig. 4 the 
desired user employs 16-QAM whereas in Fig. 5 the desired 
user employs 64-QAM. It is seen that performance of the 
receiver that uses the cumulant-based classifier is even worse 
than that of the linear MMSE receiver. This is because even 
with a sample size of 512, the cumulant-based classifier 
has a relatively low accuracy in detecting the modulation 
and with the wrong modulation information, the interference 
cancellation receiver actually enhances the interference. On the 
other hand, the receiver that employs the K-S classifier exhibits 
performance that is close to the ideal receiver performance, 
and offer a gain of 2dB and 3dB respectively at the BER of 
0.01 compared to the linear MMSE receiver. 
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Fig. 4. BER performance of OFDM-SDMA receivers. The desired user 
employs the 16-QAM modulation; and the interferer employs modulations 
from {4-QAM, 16-QAM, 64-QAM}. 



Fig. 5. BER performance of OFDM-SDMA receivers. The desired user 
employs the 64-QAM modulation; and the interferer employs modulations 
from {4-QAM, 16-QAM, 64-QAM}. 



VI. Conclusion 

We have proposed a new modulation classification technique 
based on the Kolmogorov-Smirnov (K-S) test, for classifying 
different QAM modulation formats. The basic procedure in- 
volves computing the ECDF of some decision statistic derived 
from the received signal, and comparing it with the CDFs of 
the signal under each candidate modulation format. Compared 
with the popular cumulant-based modulation classifiers, the 
proposed K-S classifiers offer faster (i.e., requiring less num- 
ber of signal samples) and superior performance. Regarding 
the complexity issue, we propose the low complexity method 
based on the robustness of K-S classifier. Moreover, the K- 
S classifier offers a method of interference cancellation in 
OFDM-SDMA systems. 
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